Extending Learning Vector Quantization for Classifying Data with Categorical Values
نویسندگان
چکیده
Learning vector quantization (LVQ) is a supervised neural network method applicable in non-linear separation problems and widely used for data classification. Existing LVQ algorithms are mostly focused on numerical data. This paper presents a batch type LVQ algorithm used for classifying data with categorical values. The batch learning rules make possible to construct the learning methodology for data in categorical nonvector spaces. Experiments on UCI data sets demonstrate the proposed algorithm is effective to improve the capability of standard LVQ to handle data with categorical values.
منابع مشابه
Comparison of classic regression methods with neural network and support vector machine in classifying groundwater resources
In the present era, classification of data is one of the most important issues in various sciences in order to detect and predict events. In statistics, the traditional view of these classifications will be based on classic methods and statistical models such as logistic regression. In the present era, known as the era of explosion of information, in most cases, we are faced with data that c...
متن کاملNGTSOM: A Novel Data Clustering Algorithm Based on Game Theoretic and Self- Organizing Map
Identifying clusters is an important aspect of data analysis. This paper proposes a noveldata clustering algorithm to increase the clustering accuracy. A novel game theoretic self-organizingmap (NGTSOM ) and neural gas (NG) are used in combination with Competitive Hebbian Learning(CHL) to improve the quality of the map and provide a better vector quantization (VQ) for clusteringdata. Different ...
متن کاملAccelerating Competitive Learning Graph Quantization
Vector quantization(VQ) is a lossy data compression technique from signal processing for which simple competitive learning is one standard method to quantize patterns from the input space. Extending competitive learning VQ to the domain of graphs results in competitive learning for quantizing input graphs. In this contribution, we propose an accelerated version of competitive learning graph qua...
متن کاملAdaptive Active Classification of Cell Assay Images
Classifying large datasets without any a-priori information poses a problem in many tasks. Especially in the field of bioinformatics, often huge unlabeled datasets have to be explored mostly manually by a biology expert. In this work we consider an application that is motivated by the development of high-throughput microscope screening cameras. These devices are able to produce hundreds of thou...
متن کاملVQ-based written language identification
Humans can recognize different types of written languages by their grammars and vocabularies. However, computers see everything as numbers. We present a computational algorithm for machine classification of written languages using the method of vector quantization. For a language document, each word is converted to a sequence of numbers and forms as a vector of numerical values according to its...
متن کامل